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A video conferencing system in which a participant may change the camera position or zoom by using a pointing device, 
such as a mouse, to position a cursor on the video display screen. In one embodiment, the user can control the panning and tilting 
of the camera by positioning the cursor on one of four arrowheads located on the four edges of the video display screen. The arro- 
wheads are outlines, transparent inside the lines, overlaid onto the video conference image. When a user moves the cursor into a 
predefined active area surrounding the arrowheads, the entire arrowhead changes to an appearance (e.g. bright color) to indicate 
that it is active. Once the cursor is in the active area, the user can press a button on the pointing device to move the camera in the 
desired direction. If the user holds down the button, the camera continues to move in the chosen direction. If the user clicks the 
button, the camera moves in short increments in the chosen direction. This clicking method allows the user to make small adjust- 
ments in the camera's position, whereas holding down the button on the pointing device enables the user to make larger changes 
in camera position quickly and efficiently. Using this technique of camera control, a participant in a video conference can point 
the cursor at the top arrow to move the camera up, the bottom arrow to move the camera down, and so on. 
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METHOD AND APPARATUS FOR ON-SCREEN CAMERA 
CONTROL IN VIDEO- CONFERENCE EQUIPMENT 

Background of the Invention 
Field of the Invention 

This invention relates to video conferencing systems. 
Related Art 

Videophones and video conferencing systems are 
10 becoming increasingly popular. Through the use of a video 
conferencing system, conferees at a variety of locations 
can have meetings and pass both video and audio information 
over the public telephone lines. Typically, a video 
conferencing system will include one or more cameras, 
15 microphones, speakers and displays disposed at each 
conference location. By transmitting control information 
over the telephony lines, conferees at any location can 
control the cameras, microphones, speakers and displays 
both at their own location and at the locations of the 
20 other parties. For example, by operation of a control 
panel, a conferee at location "A" can pan the camera at 
location "B" and then zoom in on a particular person or 
object. An example of a prior art video conferencing 
system is the PictureTel System 4000 (manufactured by 
25 PictureTel Corporation of Danvers, Massachusetts) . In 
video conferencing systems such as the System 4000, 
conference control is provided by way of a control box 
which usually rests on a table. Camera control is 
performed by repeatedly pressing a button on the control 
30 box, representing a direction (up, down, right, left), or 
by physically moving the camera to point in the desired 
direction. 

Although the above-described system provides a 
functional solution to camera control, using a button on a 
35 box located on a table in front of the participant requires 
the participant to take his eyes off the image on the 
screen to locate the button, and then to repeatedly press 
the button in order to move the camera in the desired 
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direction. If the participant moves the camera too far, 
say, to the left, he may again need to look down at the 
button box to find the button for moving the camera to the 
right. These small adjustments can require repeated 
5 actions distracting to both the person adjusting the camera 
as well as to the participants at the remote end of the 
conference. 

Another common method for controlling the position of 
a camera in video conference equipment is to manually move 

10 the camera so that it points in the desired direction. 
This requires sitting within arm's reach of the camera or 
getting up during a conference to adjust the camera. 
Further, it takes considerable attention to physically turn 
the camera and may, at least momentarily, obscure the image 

15 during adjustment. 

II. Summary of the Invention 

This invention provides an intuitive and natural means 
for controlling a camera being used as part of video- 

20 conference equipment. A video conference participant 
controls the camera by using a pointing device, such as a 
mouse, to position a cursor on the video display screen. 

In one embodiment, the user can control the panning 
and tilting of the camera by positioning the cursor on one 

25 of four arrowheads located on the four edges of the video 
display screen. The arrowheads are outlines, transparent 
inside the lines, overlaid onto the video conference image. 
When a user moves the cursor into a predefined active area 
surrounding the arrowheads, the entire arrowhead changes 

30 appearance (e.g. to a bright color) to indicate that it is 
active. Once the cursor is in the active area, the user 
can press a button on the pointing device to move the 
camera in the desired direction. If the user holds down 
the button, the camera continues to move in the chosen 

35 direction. If the user clicks the button, the camera moves 
in short increments in the chosen direction. This clicking 
method allows the user to make small adjustments in the 
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camera's position, whereas holding down the button on the 
pointing device enables the user to make larger changes in 
camera position quickly and efficiently. Using this 
technique of camera control, a participant in a video 
5 conference can point the cursor at the top arrow to move 
the camera up, the bottom arrow to move the camera down, 
and so on. This action feels much the same as, for 
instance, pointing the lens of a video camera at an object 
to be video-taped* 

10 

III. Brief Description of the Drawings 

FIG. 1 shows the video portion of a video conferencing 
system according to an embodiment of the present invention; 

FIG, 2 is an illustration of a display screen having 
15 a graphics overlay with directional control arrows 
according to an embodiment of the present invention; 

FIG. 3 is an illustration of the display screen of 
FIG. 2 after a directional arrow has been selected; 

FIG* 4 is a flowchart illustrating the operation of 
20 the processor of FIG. 1 in performing the video graphics 
overlay functions; 

FIG. 5 is a flowchart illustrating the operation of a 
re-center and zoom operation of the processor of FIG. 1 
according to an embodiment of the present invention; 
25 FIG. 6 is a flowchart illustrating the operation of 

re-center and zoom operations of the processor of FIG. 1 
according to an alternative embodiment of the present 
invention. 

Like numbered reference numerals appearing in more 
30 than one figure represent like elements. 

IV. Detailed Description of the Preferred Embodiment 
This invention provides the means to adjust a camera 

with minimal distraction to the participants. By using a 
35 mouse, a participant can easily move the cursor towards the 
appropriate arrow. Once the cursor enters the active area 
for an arrow, the arrowhead lights up to show it is active. 
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As soon as the cursor is in an active area, the user can 
either cause the camera to move continuously by holding 
down the mouse button or he can adjust the camera positions 
in small increments by pressing and releasing the mouse 
5 button until the camera reaches the desired spot. Either 
method can be done without taking ones eyes off the screen. 
Advantageously, the action of pointing the cursor in the 
direction one wishes the camera to move is a natural and 
intuitive means of controlling the camera. 

10 FIG. 1 shows the video portion of a video conferencing 

system according to an embodiment of the present invention. 
Two conference stations 100, 101 are illustrated, although 
it should be understood that more stations can be connected 
into the video conference by way of the digital telephony 

15 communication network 102. Each conference station 100, 
101 includes a conventional television camera 104 (104A in 
conference station 101) and its associated control and 
positioning motors 106. Each station also includes a 
conventional processor 108 having a pointing device 110 

20 such as a mouse. The processor 108 is connected to a video 
graphics interface 112 which is, in turn, connected to a 
video mixer 114. The video graphics interface 112 converts 
digitally encoded graphics display data generated by the 
processor 108 into analog video signals. The processor 108 

25 and the video graphics interface can be embodied, for 
example, as an IBM compatible 80X86 based computer with a 
Video Graphics Adaptor (VGA) card. Alternatively, the 
video conferencing system of FIG. 1 could be, for example, 
embodied by modifying the processor firmware of an existing 

30 PictureTel System 4 000 as described herein, and by 
providing the System 4000 processor with the pointing 
device hardware and handlers. 

Each station also includes a conventional video 
coder/decoder (Video CODEC) 116 of a type having an 

35 internal video data buffer. The Video CODEC 116 is 
connected to the communications network by way of a digital 
telephony communications link 118. The Video CODEC 116 
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sends and receives encoded digital video information from 
the digital telephony communications network 102 and 
converts the information into analog video signals. A 
video mixer 114 is connected to both the Video Codec 116 
5 and the Video Graphics Interface 112- The video mixer 
receives the analog video signals from Video CODEC 116 and 
mixes them with the analog video signals from the video 
graphics interface 112. Thus, the video mixer generates a 
combined video image comprising the conference image from 
10 the Video CODEC 116 overlaid with the graphics video 
generated by the video graphics interface 112. This 
combined video image is displayed on a conventional video 
display 115. 

The Video CODEC 116 is also connected to the processor 

15 108 and the television camera 104 and its positioning 
motors 106. The Video CODEC converts digital camera 
control information received by way of the telephony 
network 102 or the processor 108 and converts it into 
analog camera positioning signals* These signals are sent 

20 to the camera positioning motors 106 which control the 
position of the camera (e.g. pan and tilt) and the camera 
positive and negative zoom functions. The Video CODEC 116 
also receives television signals from the camera 104 and 
converts them into digital video signals for transmission 

25 over the communications network. Either processor 108 can 
control the camera within its own video conferencing 
station and, through the Communication Network 102, it can 
also control the far end (remote) camera. For example, the 
processor 108 in the first video conferencing station 100 

30 can control its own camera 104 as well as the camera 104A 
in the second video conferencing station 101. The 
processor 100 controls which video conferencing station 
100, 101 is selected as a source/destination for the Video 
CODEC 116 by sending the Video CODEC 116 appropriate 

35 routing control information. The user determines which 
conference station* s image is to be displayed (and informs 
the processor 108) by way of a selection on a menu bar. 
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The Video CODEC 116 sends the camera control information 
generated by the processor 108 to the conferencing station 
whose image is currently being displayed. 

The appearance of the screen of the video display 
5 according to an embodiment of the present invention is 
illustrated in FIGs. 2 and 3. The processor 108 generates 
camera control arrows 202A-D which are overlaid on a video 
conference image 204 (received from the Video CODEC 116) . 
The dotted lines surrounding the arrowheads 202A-D do not 

10 appear on the screen. They are included here to indicate 
the* approximate position and size of the active areas. 

FIG. 3 illustrates the change in a camera control 
arrowhead once the cursor enters the active area around the 
arrowhead. When a user moves the mouse, the cursor (e.g. 

15 cross-hairs) are correspondingly moved on the screen. When 
the processor detects that the cursor has been moved within 
the active area around an arrowhead, the processor 
highlights that arrowhead and turns off all of the other 
arrows. When the processor detects depression of a button 

20 on the pointing device, it generates camera positioning 
control signals which it sends to the Video CODEC 116. 
Depending on whether the system is set up for remote and/or 
local camera control, these control* signals are sent either 
to the communications network (where they are used to 

25 control the far end camera position motors at the target 
remote conference station) or to the local camera 
positioning motors. 

FIG. 4 is a flow chart of the graphic overlay control 
software for the processor of FIG. 1. In step 402 the 

30 processor determines the cursor position on the Video 
Display 115. This position is controlled by a user by way 
of the mouse 110. As is conventional, the processor 
constantly monitors the mouse and displays the cursor at 
the appropriate position on the video display screen. In 

35 steps 404 through 410, the processor determines whether the 
cursor is within one of the active regions for the 
displayed directional arrows and if so, determines in which 
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arrows region the cursor is displayed. Next, in steps 
412*418, if the cursor is in the region of one of the 
arrows, the processor highlights that arrow and turns off 
highlighting on all of the others. Next, in steps 420-426 , 
5 the processor determines if the pointing device button is 
being held down (a single button mouse will be assumed here 
although the processor could just as readily look for 
depression of a particular button on a multi-button mouse) . 
If so, in steps 428-434, the camera is moved one step in 

10 the direction of the arrow. If the pointing device button 
is not being pressed, the processor returns to step 402. 
If, in steps 404-410, it is determined that the cursor is 
not within one of the active regions, in step 436, the 
highlighting on all arrows is turned off. J 

15 This method of camera control can also be implemented 

using a keyboard with directional arrows to position a 
cursor on the displayed arrowheads or with other pointing 
devices such as a trackball instead of a mouse. Further, 
a touch-screen monitor can be used to display the video 

20 image. In that case, a user would touch the screen in the 
active area surrounding the arrowhead in order to move the 
camera . 

The camera control arrowhead display can be 
implemented in several ways. For example, a pure hardware 

25 implementation can be used where dedicated hardware in the 
video circuitry places the camera control arrows at fixed 
places on the video display. Another approach is to 
provide the processor with software which writes data 
directly into the screen buffer of the Video CODEC in order 

30 to display the arrowheads on the video screen. A third 
approach is to provide the processor with a video graphics 
overlay which impresses the camera control arrows over the 
standard video display by way of the video mixer. 

A similar method can be used to control zoom function. 

35 One such method is to use a three button pointing device. 
In this embodiment one button on the pointing device 
controls camera pan and tilt in accordance with the 
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selected direction arrow (as previously described) , a 
second button on the pointing device causes the camera to 
zoom-in and the third button on the pointing device causes 
the camera to zoom-out. 
5 in an alternative embodiment, the zoom-in and zoom-out 

buttons can be used to cause the camera to zoom in or out 
around a specific point selected by the pointing device. 
In this embodiment, depression of a first button on a three 
button pointing device causes the processor to generate pan 

10 and tilt control data that will cause the active camera 
(the camera whose image is being displayed) to re-center on 
a selected point (the point on which the cursor is 
located) . Depression of the second or third buttons on 
the pointing device activate, respectively, a combined re- 

15 center and zoom-in or re-center and zoom-out operation. In 
order to use the re-center and zoom function the user 
positions the cursor (by using the pointing device) in the 
center of the desired image and then presses the 
appropriate button (zoom in or zoom out) . 

20 The operation of the re-center and zoom control 

software in the processor 108 of FIG. 1, according to the 
above-described embodiment, is illustrated in FIG. 5. In 
step 502 the processor determines if any of the three 
pointing device buttons are being depressed. If none of 

25 the buttons are being depressed the processor returns to 
step 502. If any one of the buttons are being depressed, 
in step 504, the processor 108 determines the X,Y 
coordinates of the cursor position. Next, in step 506 the 
processor calculates the difference in position (delta X 

30 and delta Y) between the cursor position and the center of 
the displayed conference image. Then, in step 508 the 
processor uses the delta X and delta Y values to generate 
the appropriate pan and tilt control information to cause 
the image to be re-centered around the cursor location and 

35 provides this control information to the Video CODEC 116. 
The pan and tilt control information can be determined by 
the use of a look up table wherein each entry in the table 
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corresponds to an appropriate number and direction of pan 
and tilt steps for a given delta X and delta Y. The 
specific table values can be precoded into the look-up 
table if the camera/motor types at each station are known 
5 or can be exchanged by each station providing the other 
with its camera/motor control parameters over the 
communication link. 

In steps 510, 512 the processor determines whether one 
of the zoom-in or zoom-out buttons is being depressed. If 

10 not, the processor returns to step 502. If in step 510 it 
is determined that the zoom- in button is being depressed, 
in step 514 the processor generates control signals for one 
zoom-in step (stepping control of the zoom camera motors is 
assumed here) , provides them to the Video CODEC 116 and 

15 returns to step 502. Similarly, if in step 512 it is 
determined that the zoom-out button is being depressed, in 
step 516 the processor generates control signals for one 
zoom-out step, provides them to the Video CODEC 116 and 
returns to step 502. Just as with the highlighted arrow 

20 pan/tilt control (previously described) , if the zoom-in or 
zoom-out button is held down continuously, the processor 
will, correspondingly, continuously generate zoom-in or 
zoom-out control step signals until the button is released. 
As an alternative to the above-described embodiment 

25 the zoom-in and zoom-out control can be de-coupled from the 
centering controls. This embodiment is illustrated in FIG. 
6. As with the embodiment of FIG. 5, in step 502 the 
processor determines if any of the three pointing device 
buttons are being depressed. If none of the buttons are 

30 being depressed the processor returns to step 502. If any 
one of the buttons are being depressed, in step 602 the 
processor determines if the "re-center" button (e.g. the 
far left button) has been depressed. If yes, the processor 
108 determines the X,Y coordinates of the cursor position 

35 in step 504, calculates the delta X, delta Y values 
relative to the center of the displayed image in step 506 
and then, in step 508, generates appropriate pan and tilt 
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control signals to re-center the image. If the re-center 
button was not depressed, the processor performs step 510 
in which the processor determines whether one of the zoom- 
in button (e.g. the center button on the pointing device) 
5 is being depressed. If in step 510 it is determined that 
the zoom-in button is being depressed, in step 514 the 
processor generates control signals for one zoom-in step 
(stepping control of the zoom camera motors is assumed 
here) , provides them to the Video CODEC 116 and returns to 

10 step 502. If the zoom- in button is not being depressed, in 
step 516 the processor generates control signals for one 
zoom-out step, provides them to the Video CODEC 116 and 
returns to step 502. It is noted that the processor can 
determine that the zoom-out button was depressed since the 

15 pointing device is known to be a three button device and 
the other two buttons were eliminated as choices in steps 
602 and 510. 

It will be appreciated that in the embodiment of FIG. 
5, the user can re-center the image around a selected point 

20 and either zoom in, zoom-out or take no further action, by 
depressing a single button on the pointing device. In the 
embodiment of FIG. 6, the functions of re-center, zoom-in 
and zoom-out are each accomplished independently by a 
separate button. 

25 Either of the embodiments of FIG* 5 or 6 can be used 

in conjunction with the directional arrows of FIGs. 2-4. 
In such an implementation, the processor first determines 
if the cursor is within one of the active regions in or 
around an arrow. If so, the method of FIG. 4 is performed. 

30 If not, the method of either FIG. 5 or FIG. 6 is performed 
(whichever has been implemented) . 

In another alternative embodiment, the user can use 
the pointing device to draw a selection border (preferably 
rectangular) around the desired image. The processor 

35 software then changes the camera position so as to re- 
center the image (around the center of the selected area) 
and changes and zoom factor to cause the selected image to 
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fill the display screen. In order to determine the 
appropriate zoom factor, the processor calculates the 
ratio (dl/d2) of the diagonal (dl) of the full displayed 
image over the diagonal (d2) of the selected image. 
5 Control software can he used to ensure that the relative 
dimensions of the selected area are of the same proportion 
as the displayed image (alternatively, the processor can 
just re-center and use the diagonal data as is) . Both the 
zoom and pan and tilt control information can be determined 

10 by the use of a look up table. The use of a look up table 
for re-centering has been previously described. In a 
similar manner, for the zoom factor, each entry in the 
table corresponds to an appropriate number of zoom steps 
for a given dl/d2 ratio. The specific table values can be 

15 precoded into the look-up table if the zoom motor types at 
each station are known or can be exchanged by each station 
providing the other with its zoom motor control parameters 
over the communication link* As described with respect to 
the embodiments of FIG. 5 and FIG. 6, this method of 

20 performing a re-center and zoom operation can be used in 
conjunction with the control arrows of FIGs. 2-4. 

As is conventional, the processor may be provided with 
additional Graphic User Interface (GUI) software which 
enables a user to control various conference features (e.g. 

25 far/near camera select, volume control ...) by way of menu 
bars displayed along the periphery of the video display 
screen. These menus can be implemented such that they can 
be hidden or displayed under user control. When the menus 
are displayed, the video image is compressed into the 

30 slightly smaller space remaining on the display screen. 

Now that the invention has been described by way of 
the preferred embodiment, various enhancements and 
improvements which do not depart from the scope and spirit 
of the invention will become apparent to those of skill in 

35 the art. Thus it should be understood that the preferred 
embodiment has been provided by way of example and not by 
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way of limitation. The scope of the invention is defined 
by the appended claims. 
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CLAIMS : 

!• A camera control mechanism for a video 
conferencing system, comprising: 

first means for receiving video signals from a 
5 communications network and generating a video image 
therefrom; 

second means for generating a graphic overlay and for 
superimposing the graphic overlay on the video image; 

third means for manipulating a cursor on the video 
10 image; 

fourth means, coupled to the first means, the second 
means and the third means, for determining a position of 
the cursor on the video image and for generating control 
signals to move a camera providing a source of the video 
15 image in a direction indicated by a position of the cursor 
on the video image. 

2. The apparatus of Claim 1 wherein the third means 
comprises a pointing device having a control button and 

20 wherein the processing means generates the camera control 
information responsive to the position of the cursor at the 
time of depression of the control button by the user. 

3. The apparatus of Claim 2 wherein the pointing 
25 device is a mouse. 

4. The apparatus of Claim 2 wherein the graphics 
overlay comprises directional arrows and wherein the second 
means displays the video image overlaid with the 

30 directional arrows. 

5. The apparatus of Claim 4 wherein the fourth means 
comprises means for determining whether the position of the 
cursor is in proximity to one of the directional arrows and 

35 for generating directional motor control corresponding 
thereto when the control button is depressed and it is 



WO 94/07327 PCT/US93/07948 

14 

determined that the cursor is in the proximity of the one 
of the directional arrows. 

6. The apparatus of Claim 5 wherein the fourth means 
5 comprises means for highlighting the one of the directional 

arrows. 

7. A video conferencing system, comprising: 

a video encoder/decoder,/ the video encoder/decoder 
10 including means for interfacing with a communication 
network ; 

a camera connected to supply video signals to the 
video encoder/decoder; 

a camera positioning motor connected to the camera and 
15 connected to receive camera positioning information from 
the video encoder/decoder; 

processing means for generating digitally encoded 
video display data and for generating digital motor control 
information, the processing means being connected to the 
20 digital motor control information to the encoder/decoder; 

a video graphics interface connected to receive the 
encoded video display data from the processor; 

a video mixer, connected to receive graphic video 
signals from the video graphics interface and live image 
25 video signals from the video encoder/decoder; 

a video display, coupled to the video mixer, for 
displaying the live video image overlaid with the graphic 
video; and, 

a pointing device coupled to the processor; the 
30 pointing device comprising a push button switch and signal 
means for indicating when the push button switch has been 
depressed by a user; 

wherein, the processing means further comprises 
position detection and display means, for causing a cursor 
35 to be displayed on the video display at a position 
responsive to user control of the pointing device, and for 
generating different motor control signals dependent on a 
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calculated position of the cursor on the live video image 
when the push button switch is depressed by the user ♦ 

8. The apparatus of Claim 7 wherein the graphics 
5 image comprises directional arrows and wherein the video 

mixer displays the live image overlaid with the directional 
arrows . 

9. The apparatus of Claim 8 wherein the processing 
10 means comprises means for determining whether the position 

of the cursor is in proximity to one of the directional 
arrows and for generating directional motor control 
corresponding thereto when the push button switch is 
depressed and it is determined that the cursor is in the 
15 proximity of the one of the directional arrows. 

10. The apparatus of Claim 9 wherein the processing 
means comprises means for highlighting the one of the 
directional arrows. 

20 

11. The apparatus of Claim 7 wherein the processing 
means further comprises means for causing a selected camera 
to zoom in on a point of the 1 ive video image sub j ect 
responsive to user manipulation of the cursor on the live 

25 video image. 

12. A method of controlling a camera in a video 
conferencing system, comprising the steps of : 

receiving video signals from a communications network 
30 and generating a live video image therefrom; 

manipulating a cursor on the live video image; 
positioning a cursor on the live video image using a 
pointing device having at least one control switch; 

determining when the control switch has been actuated 
35 and, in response, determining a present position of the 
cursor on the live video image; 



WO 94/07327 FCT/US93/07948 

16 

in further response to actuation of the control 
switch, generating camera control signals to reposition a 
camera providing a source of the live video image so as to 
re-center the live image around the present position of the 
5 cursor* 

13. The method of Claim 12 comprising the further 
step of: 

in further response to actuation of the control 
10 switch, generating camera control signals to cause the 
camera providing the source of the live video image so as 
to perform a zoom- in step on the live video image. 

14. The method of Claim 12 comprising the further 
15 step of: 

in further response to actuation of the control 
switch, generating camera control signals to cause the 
camera providing the source of the live video image so as 
to perform a zoom-out step on the live video image. 

20 

15. The method of Claim 12 comprising the further 
steps of: 

generating a graphic overlay comprising a plurality of 
directional arrows; 

25 superimposing the graphic overlay on the live video 

image ; 

defining an active region around each of the 
directional arrows? 

responsive to the switch being actuated determining if 
30 the cursor is in one of the active regions; 

if the cursor is in one of the active regions, 
generating camera control signals to move the camera 
providing the source of the live video image in a direction 
of a directional arrow within the one of the active 
35 regions, instead of re-centering the live video image 
around the present position of the cursor. 
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16. A method of controlling a camera in a video 
conferencing system, comprising the steps of : 

receiving video signals from a communications network 
and generating a live video image therefrom; 
5 generating a graphic overlay comprising a plurality of 

directional arrows; 

superimposing the graphic overlay on the live video 
image ; 

defining an active region around each of the 
10 directional arrows; 

manipulating a cursor on the live video image; 

positioning a cursor on the live video image using a 
pointing device having at least a first control switch; 

determining when the control switch has been actuated 
15 and, in response, determining a present position of the 
cursor on the live video image; 

responsive to the control switch being actuated 
determining if the present position of the cursor is in one 
of the active regions; 
20 if the present position of the cursor is in one of the 

active regions, generating camera control signals to move 
the camera providing the source of the live video image in 
a direction of a directional arrow within the one of the 
active regions. 

25 

17. The method of Claim 16, comprising the further 
steps of : 

determining when a second control switch on the 
pointing device has been actuated; and, 
30 responsive to the second control switch being actuated 

generating camera control signals to cause the camera 
providing the source of the live video image to change a 
zoom factor of the live video image. 



WO 94/07327 PCT/US93/07948 

18 

18. A video conferencing system, comprising: 

means for receiving video signals from a 

communications network and generating a live video image 

therefrom; 

5 means for generating a graphic overlay comprising a 

plurality of directional arrows; 
a video display; 

means, coupled to the video display, the means for 
receiving and the means for generating, for superimposing 
10 the graphic overlay on the live video image; 

means for defining an active region around each of the 
directional arrows; 

a pointing device for manipulating a cursor on the 
live video image, the pointing device having at least a 
15 first control switch; 

means, coupled to the pointing device, for determining 
when the control switch has been actuated and, in response, 
determining a present position of the cursor on the live 
video image; 

20 means, coupled to the pointing device, the means for 

defining, and the communications network, for determining 
if the present position of the cursor is in one of the 
active regions responsive to the control switch being 
actuated, and if the present position of the cursor is in 

25 one of the active regions, generating camera control 
signals to move the camera providing the source of the live 
video image in a direction of a directional arrow within 
the one of the active regions. 
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